33 research outputs found

    Future value based single assignment program representations and optimizations

    Get PDF
    An optimizing compiler internal representation fundamentally affects the clarity, efficiency and feasibility of optimization algorithms employed by the compiler. Static Single Assignment (SSA) as a state-of-the-art program representation has great advantages though still can be improved. This dissertation explores the domain of single assignment beyond SSA, and presents two novel program representations: Future Gated Single Assignment (FGSA) and Recursive Future Predicated Form (RFPF). Both FGSA and RFPF embed control flow and data flow information, enabling efficient traversal program information and thus leading to better and simpler optimizations. We introduce future value concept, the designing base of both FGSA and RFPF, which permits a consumer instruction to be encountered before the producer of its source operand(s) in a control flow setting. We show that FGSA is efficiently computable by using a series T1/T2/TR transformation, yielding an expected linear time algorithm for combining together the construction of the pruned single assignment form and live analysis for both reducible and irreducible graphs. As a result, the approach results in an average reduction of 7.7%, with a maximum of 67% in the number of gating functions compared to the pruned SSA form on the SPEC2000 benchmark suite. We present a solid and near optimal framework to perform inverse transformation from single assignment programs. We demonstrate the importance of unrestricted code motion and present RFPF. We develop algorithms which enable instruction movement in acyclic, as well as cyclic regions, and show the ease to perform optimizations such as Partial Redundancy Elimination on RFPF

    Higher-order spin-hole correlations around a localized charge impurity

    Get PDF
    Analysis of higher-order correlation functions has become a powerful tool for investigating interacting many-body systems in quantum simulators, such as quantum gas microscopes. Experimental measurements of mixed spin-charge correlation functions in the 2D Hubbard have been used to study equilibrium properties of magnetic polarons and to identify coherent and incoherent regimes of their dynamics. In this paper we consider theoretically an extension of this technique to systems which use a pinning potential to reduce the mobility of a single dopant in the Mott insulating regime of the 2D Hubbard model. We find that localization of the dopant has a dramatic effect on its magnetic dressing. The connected third order spin correlations are weakened in the case of a mobile hole but strengthened near an immobile hole. In the case of the fifth-order correlation function, we find that its bare value has opposite signs in cases of the mobile and of fully pinned dopant, whereas the connected part is similar for both cases.We study suppression of higher-order correlators by thermal fluctuations and demonstrate that they can be observed up to temperatures comparable to the spin-exchange energy J. We discuss implications of our results for understanding the interplay of spin and charge in doped Mott insulators

    Dynamic Contrastive Distillation for Image-Text Retrieval

    Get PDF
    Although the vision-and-language pretraining (VLP) equipped cross-modal image-text retrieval (ITR) has achieved remarkable progress in the past two years, it suffers from a major drawback: the ever-increasing size of VLP models restrict its deployment to real-world search scenarios (where the high latency is unacceptable). To alleviate this problem, we present a novel plug-in dynamic contrastive distillation (DCD) framework to compress the large VLP models for the ITR task. Technically, we face the following two challenges: 1) the typical uni-modal metric learning approach is difficult to directly apply to cross-modal task, due to the limited GPU memory to optimize too many negative samples during handling cross-modal fusion features. 2) it is inefficient to static optimize the student network from different hard samples, which have different effects on distillation learning and student network optimization. We try to overcome these challenges from two points. First, to achieve multi-modal contrastive learning, and balance the training costs and effects, we propose to use a teacher network to estimate the difficult samples for students, making the students absorb the powerful knowledge from pre-trained teachers, and master the knowledge from hard samples. Second, to dynamic learn from hard sample pairs, we propose dynamic distillation to dynamically learn samples of different difficulties, from the perspective of better balancing the difficulty of knowledge and students' self-learning ability. We successfully apply our proposed DCD strategy on two state-of-the-art vision-language pretrained models, i.e. ViLT and METER. Extensive experiments on MS-COCO and Flickr 30 K benchmarks show the effectiveness and efficiency of our DCD framework. Encouragingly, we can speed up the inference at least 129 × compared to the existing ITR models. We further provide in-depth analyses and discussions that explain where the performance improvement comes from. We hope our work can shed light on other tasks that require distillation and contrastive learning

    Strong Inter-valley Electron-Phonon Coupling in Magic-Angle Twisted Bilayer Graphene

    Full text link
    The unusual properties of superconductivity in magic-angle twisted bilayer graphene (MATBG) have sparked enormous research interest. However, despite the dedication of intensive experimental efforts and the proposal of several possible pairing mechanisms, the origin of its superconductivity remains elusive. Here, using angle-resolved photoemission spectroscopy with micrometer spatial resolution, we discover replicas of the flat bands in superconducting MATBG unaligned with its hexagonal boron nitride (hBN) substrate, which are absent in non-superconducting MATBG aligned with the hBN substrate. Crucially, the replicas are evenly spaced in energy, separated by 150 +- 15 meV, signalling the strong coupling of electrons in MATBG to a bosonic mode of this energy. By comparing our observations to simulations, the formation of replicas is attributed to the presence of strong inter-valley electron-phonon coupling to a K-point phonon mode. In total, the observation of these replica flat bands and the corresponding phonon mode in MATBG could provide important information for understanding the origin and the unusual properties of its superconducting phase.Comment: 17 pages, 4 figure

    Anomalous excitonic phase diagram in band-gap-tuned Ta2Ni(Se,S)5

    Full text link
    During a band-gap-tuned semimetal-to-semiconductor transition, Coulomb attraction between electrons and holes can cause spontaneously formed excitons near the zero-band-gap point, or the Lifshitz transition point. This has become an important route to realize bulk excitonic insulators -- an insulating ground state distinct from single-particle band insulators. How this route manifests from weak to strong coupling is not clear. In this work, using angle-resolved photoemission spectroscopy (ARPES) and high-resolution synchrotron x-ray diffraction (XRD), we investigate the broken symmetry state across the semimetal-to-semiconductor transition in a leading bulk excitonic insulator candidate system Ta2Ni(Se,S)5. A broken symmetry phase is found to be continuously suppressed from the semimetal side to the semiconductor side, contradicting the anticipated maximal excitonic instability around the Lifshitz transition. Bolstered by first-principles and model calculations, we find strong interband electron-phonon coupling to play a crucial role in the enhanced symmetry breaking on the semimetal side of the phase diagram. Our results not only provide insight into the longstanding debate of the nature of intertwined orders in Ta2NiSe5, but also establish a basis for exploring band-gap-tuned structural and electronic instabilities in strongly coupled systems.Comment: 27 pages, 4 + 9 figure

    Unrestricted code motion: A program representation and transformation algorithms based on future values

    No full text
    We introduce the concept of future values. Using future values it is possible to represent programs in a new control-flow form such that on any control flow path the data-flow aspect of the computation is either traditional (i.e., definition of a value precedes its consumers), or reversed (i.e., consumers of a value precede its definition). The representation hence allows unrestricted code motion since ordering of instructions are not prohibited by the data dependencies. We present a new program representation called Recursive Future Predicated Form (RFPF) which implements the concept. RFPF subsumes general if-conversion and permits unrestricted code motion to the extent that the whole procedure can be reduced to a single block. We develop algorithms which enable instruction movement in acyclic as well as cyclic regions and give examples of various optimizations in RFPF form. © 2010 Springer-Verlag

    Single assignment compiler, single assignment architecture: Future gated single assignment form*; Static single assignment with congruence classes

    No full text
    We present a new static single assignment form which can be used by an optimizing compiler as its internal representation and the micro-architecture as its instruction set. This representation, Future Gated Single Assignment Form (FGSA), directly represents the use-def relationship of variables by employing the concept of congruence classes and the concept of future dependencies. We show that FGSA is efficiently computable by using a series of T1/T2 transformations, yielding an expected linear time algorithm for the construction of single assignment form. Our interval analysis method includes a novel transformation TR which eliminates irreducible loops without node splitting and combines computation of single-assignment form with irreducible loop elimination. The algorithm produces pruned single assignment form, rendering a separate pruning step unnecessary. In practice, the FGSA approach results in an average reduction of 7.7%, with a maximum of 67% in the number of gating functions compared to the pruned SSA form on the SPEC2000 benchmark suite, owing to its ability to represent dataflow within a congruence class by using a single gating function. We illustrate that FGSA is convenient to use as an internal representation in an optimizing compiler by presenting two case studies of optimization algorithms on FGSA

    Simultaneous Determination of Three Effective Components in Different Compatibility Proportion of the Couple of Sophora flavescens-Fructus Ligustri lucidi by HPLC

    No full text
    Sophora flavescens - Ligustrum lucidum has the effect of increasing the anti-cancer effect of white blood cells and treating liver fibrosis. In order to determine the optimal compatibility ratio of this drug pair, the rationality of the drug pairs in different proportions was revealed. In this experiment, three active ingredients of the Sophora flavescens-Fructus Ligustri lucidi were extracted by reflux extraction, which were matrine, oxymatrine and specnuezhenide. A method for simultaneously determining the content of matrine, oxymatrine and specnuezhenide in a short time by HPLC was established. The method is simple, precise, and stable. Reproducibility can be used to determine the content of three active ingredients of matrine, oxymatrineand specnuezhenide. The analysis results was indicated that the different proportions of compatibility and different extraction solvents have significant effects on the content of active ingredients and dissolution rate of Sophora flavescens- Fructus Ligustri lucidi, after the combination of Sophora flavescens- Fructus Ligustri lucidi in different proportions, the content of matrine, oxymatrine and specnuezhenide is greater than the content of the active ingredients of each single herb. When the ratio of compatibility of Sophora flavescens-Ligustrum Ligustri lucidi is <1:1, it is beneficial to the dissolution of the three active ingredients. By determining the optimal compatibility ratio of this drug pair, provided a theoretical basis for the study of late pharmacodynamics and provide theoretical basis for clinicalapplication of traditional Chinese medicine
    corecore